A Framework for Sequential Planning in Multi-Agent Settings

نویسندگان

Piotr J. Gmytrasiewicz

Prashant Doshi

چکیده

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space. Agents maintain beliefs over physical states of the environment and over models of other agents, and they use Bayesian update to maintain their beliefs over time. The solutions map belief states to actions. Models of other agents may include their belief states and are related to agent types considered in games of incomplete information. We express the agents’ autonomy by postulating that their models are not directly manipulable or observable by other agents. We show that important properties of POMDPs, such as convergence of value iteration, the rate of convergence, and piece-wise linearity and convexity of the value functions carry over to our framework. Our approach complements a more traditional approach to interactive settings which uses Nash equilibria as a solution paradigm. We seek to avoid some of the drawbacks of equilibria which may be non-unique and are not able to capture off-equilibrium behaviors. We do so at the cost of having to represent, process and continually revise models of other agents. Since the agent’s beliefs may be arbitrarily nested the optimal solutions to decision making problems are only asymptotically computable. However, approximate belief updates and approximately optimal plans are computable. We illustrate our framework using a simple application domain, and we show examples of belief updates and value functions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cooperative Path Planning of Dynamical Multi-Agent Systems Using Differential Flatness Approach

This paper discusses a design methodology of cooperative path planning for dynamical multi-agent systems with spatial and temporal constraints. The cooperative behavior of the multi-agent systems is specified in terms of the objective function in an optimization formulation. The path of achieving cooperative tasks is then generated by the optimization formulation constructed based on a differen...

متن کامل

A Framework for Optimal Sequential Planning in Multiagent Settings

Introduction Research in autonomous agent planning is gradually moving from single-agent environments to those populated by multiple agents. In single-agent sequential environments, partially observable Markov decision processes (POMDPs) provide a principled approach for planning under uncertainty. They improve on classical planning by not only modeling the inherent non-determinism of the probl...

متن کامل

Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Epistemic planning can be used for decision making in multi-agent situations with distributed knowledge and capabilities. Recently, Dynamic Epistemic Logic (DEL) has been shown to provide a very natural and expressive framework for epistemic planning. We extend the DEL-based epistemic planning framework to include perspective shifts, allowing us to define new notions of sequential and condition...

متن کامل

Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs

Interactive partially observable Markov decision processes (I-POMDPs) provide a principled framework for planning and acting in a partially observable, stochastic and multiagent environment, extending POMDPs to multi-agent settings by including models of other agents in the state space and forming a hierarchical belief structure. In order to predict other agents’ actions using I-POMDP, we propo...

متن کامل

Convex Coverage Set Methods for Multi-Objective Collaborative Decision Making (Doctoral Consortium)

My research is aimed at finding efficient coordination methods for multi-objective collaborative multi-agent decision theoretic planning. Key to coordinating efficiently in these settings is exploiting loose couplings between agents. We proposed two algorithms for the case in which the agents need to make a single collective decision: convex multiobjective variable elimination (CMOVE) and varia...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

J. Artif. Intell. Res.

دوره 24 شماره

صفحات -

تاریخ انتشار 2004

A Framework for Sequential Planning in Multi-Agent Settings

نویسندگان

چکیده

منابع مشابه

Cooperative Path Planning of Dynamical Multi-Agent Systems Using Differential Flatness Approach

A Framework for Optimal Sequential Planning in Multiagent Settings

Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs

Convex Coverage Set Methods for Multi-Objective Collaborative Decision Making (Doctoral Consortium)

عنوان ژورنال:

اشتراک گذاری